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Abstract SARS-CoV-2 has caused tens of thousands of infections and more than one thousand deaths. 
There are currently no registered therapies for treating coronavirus infections. Because of time consuming 
process of new drug development, drug repositioning may be the only solution to the epidemic of sudden 
infectious diseases. We systematically analyzed all the proteins encoded by SARS-CoV-2 genes, 
compared them with proteins from other coronaviruses, predicted their structures, and built 19 structures 
that could be done by homology modeling. By performing target-based virtual ligand screening, a total of 
21 targets (including two human targets) were screened against compound libraries including ZINC drug 
database and our own database of natural products. Structure and screening results of important targets 
such as 3-chymotrypsin-like protease (3CLpro), Spike, RNA-dependent RNA polymerase (RdRp), and 
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papain like protease (PLpro) were discussed in detail. In addition, a database of 78 commonly used anti¬ 
viral drugs including those currently on the market and undergoing clinical trials for SARS-CoV-2 was 
constructed. Possible targets of these compounds and potential drugs acting on a certain target were pre¬ 
dicted. This study will provide new lead compounds and targets for further in vitro and in vivo studies of 
SARS-CoV-2, new insights for those drugs currently ongoing clinical studies, and also possible new stra¬ 
tegies for drug repositioning to treat SARS-CoV-2 infections. 

© 2020 Chinese Pharmaceutical Association and Institute of Materia Medica, Chinese Academy of Medical 
Sciences. Production and hosting by Elsevier B.V. This is an open access article under the CC BY-NC-ND license 
(http://creativecommons.Org/licenses/by-nc-nd/4.0/). 


1. Introduction 

Coronaviruses (CoVs) have caused a major outbreak of human 
fatal pneumonia since the beginning of the 21st century. Severe 
acute respiratory syndrome coronavirus (SARS-CoV) broke out 
and spread to five continents in 2003 with a lethal rate of 10% 12 . 
The Middle East Respiratory Syndrome Coronavirus (MERS- 
CoV) broke out in the Arabian Peninsula in 2012 with a fatality 
rate of 35% 3 ' 4 . Both SARS-CoV and MERS-CoV are zoonotic 
viruses, and their hosts are bat/civet and dromedary, respec¬ 
tively 5,6 . To date, no specific therapeutic drug or vaccine has been 
approved for the treatment of human coronavirus. Therefore, 
CoVs are considered to be a kind of viruses, of which the outbreak 
poses a huge threat to humans. The novel coronavirus found at the 
end of 2019 was named as 2019 novel coronavirus or “2019- 
nCoV” by the World Health Organization (WHO) on January 
12, 2020 7 ' 8 . Since 2019-nCoV is highly homologous with SARS- 
CoV, it is considered a close relative of SARS-CoV. The Inter¬ 
national Virus Classification Commission (ICTV) classified 2019- 
nCoV as Severe Acute Respiratory Syndrome Coronavirus 2 
(SARS-CoV-2) on February 11, 2020. At the same time, WHO 
named the disease caused by 2019-nCoV as COVID-19. Common 
symptoms of a person infected with coronavirus include respira¬ 
tory symptoms, fever, cough, shortness of breath, and dyspnea. In 
more severe cases, infection can cause pneumonia, severe acute 
respiratory syndrome, kidney failure, and even death. There is 
currently no specific medicine or treatment for diseases caused by 
SARS-CoV-2 9 . 

CoVs are enveloped viruses with a positive RNA genome, 
belonging to the Coronaviridae family of the order Nidovirales, 
which are divided into four genera (a, f}, y, and o). The SARS- 
CoV-2 belongs to the fl genus. CoVs contain at least four struc¬ 
tural proteins: Spike (S) protein, envelope (E) protein, membrane 
(M) protein, and nucleocapsid (N) protein 10 . Among them, Spike 
promotes host attachment and virus—cell membrane fusion during 
virus infection. Therefore, Spike determines to some extent the 
host range. 

Potential anti-coronavirus therapies can be divided into two 
categories depending on the target, one is acting on the human 
immune system or human cells, and the other is on coronavirus 
itself. In terms of the human immune system, the innate immune 
system response plays an important role in controlling the repli¬ 
cation and infection of coronavirus, and interferon is expected to 
enhance the immune response 11 . Blocking the signal pathways of 
human cells required for virus replication may show a certain anti¬ 
viral effect. In addition, viruses often bind to receptor proteins on 
the surface of cells in order to entering human cells, for example, 
the SARS virus binds to the angiotensin-converting enzyme 2 
(ACE2) receptor 12-14 and the MERS binds to the DPP4 


receptor 15 ' 16 . The therapies acting on the coronavirus itself include 
preventing the synthesis of viral RNA through acting on the ge¬ 
netic material of the virus, inhibiting virus replication through 
acting on critical enzymes of virus, and blocking the virus binding 
to human cell receptors or inhibiting the virus’s self-assembly 
process through acting on some structural proteins. 

In the fight against coronavirus, scientists have come up with 
three strategies for developing new drugs 1 . The first strategy is 
to test existing broad-spectrum anti-virals 18 . Interferons, ribavirin, 
and cyclophilin inhibitors used to treat coronavirus pneumonia fall 
into this category. The advantages of these therapies are that their 
metabolic characteristics, dosages used, potential efficacy and side 
effects are clear as they have been approved for treating viral 
infections. But the disadvantage is that these therapies are too 
“broad-spectrum” and cannot kill coronaviruses in a targeted 
manner, and their side effects should not be underestimated. The 
second strategy is to use existing molecular databases to screen 
for molecules that may have therapeutic effect on coronavirus 19 ’ 20 . 
High-throughput screening makes this strategy possible, and new 
functions of many drug molecules can be found through this 
strategy, for example, the discovery of anti-HIV infection drug 
lopinavir/ritonavir. The third strategy is directly based on the 
genomic information and pathological characteristics of different 
coronaviruses to develop new targeted drugs from scratch. 
Theoretically, the drugs found through these therapies would 
exhibit better anti-coronavirus effects, but the research procedure 
of new drug might cost several years, or even more than 10 

For the development of medicines treating SARS-CoV-2, the 
fastest way is to find potential molecules from the marketed drugs. 
Once the efficacy is determined, it can be approved by the Green 
Channel or approved by the hospital ethics committee for rapid 
clinical treatment of patients. Herein, bioinformatics analysis on 
the proteins encoded by the novel coronavirus genes was sys¬ 
tematically conducted, and the proteins of SARS-CoV-2 were 
compared with other coronaviruses, such as SARS-CoV and 
MERS-CoV. We conducted homology modeling to build all 
possible protein structures, including viral papain like protease 
(PLpro), main protease (3CLpro, also named 3-chymotrypsin-like 
protease), RNA-dependent RNA polymerase (RdRp), helicase, 
Spike, etc. Further, we used these proteins and human relative 
proteins [human ACE2 and type-II transmembrane serine protease 
(TMPRSS2) enzymes] as targets to screen ZINC U. S. Food and 
Drug Administration (FDA)-approved drug database (ZINC drug 
database, ZDD), our own database of traditional Chinese medicine 
and natural products (including reported common anti-viral 
components from traditional Chinese medicine), and the data¬ 
base of commonly used anti-viral drugs (78 compounds) by virtual 
ligand screening method. This study predicts a variety of 
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compounds that may inhibit novel coronaviruses and provides 
scientists with information on compounds that may be effective. 
Subsequent validation of anti-viral effects in vitro and in vivo will 
provide useful information for clinical treatment of novel coro- 
navirus pneumonia. 

2. Methods and materials 

2.1. Homology genome blast and genomes information 

The complete genome of SARS-CoV-2/WHU02 (MN988669.1) 
was downloaded from NCBI nucleotide database. Thenucleotide 
sequences were aligned with whole database using BLASTn 
to search for homology viral genomes(Alogorithm parame¬ 
ters, max target sequences: 1000, expect threshold: 10). 
Accession numbers of 22 sequences in GenBank are listed as 
follows: BetaCoV/YN2018B(MK211376.1), Human/CoV HKU1 
isolate SI17244 (MH940245.1), Human/CoV HKU1 genotype 
B(DQ415911.1), Rodent/RtTn-CoV (KY370043.1), DUCK/CoV 
(KM454473.1), Feline/CoV UU5(FJ938056.1), Bat/CoV HKU9-4 
(EF065516.1), MERS NL13845 (MG021451.1), SARS-CoV-2/ 
WHU02 (MN988669.1), 2019-nCoV/USA-ILl(MN988713.1), 
SARS-CoV-2/USA-CA2 (MN994468.1), SARS-CoV-2/HKU-SZ- 
005b (MN975262.1), SARS-CoV-2/USA-WAl/2020 (MN985325.1), 
SARS-CoV-2/USA-AZl/2020 (MN997409.1), SARS-CoV-2/USA- 
CA1 (MN994467.1). 

2.2. Nucleotide and amino acid sequence alignment and 
analysis 

Nucleotide sequence editing was conducted using Bioedit and 
DNAMAN, and sequence alignment was conducted using DNA- 
MAN and ClustaTW. The evolutionary history was inferred using 
the Neighbor-Joining method in MEGA seven software package. 
The percentage of replicate trees in which the associated taxa 
clustered together in the bootstrap test was determined by 500 
replicates. Protein sequence management and analysis were car¬ 
ried out by using snap gene view, and sequence alignment was 
performed using Clustalw method in Jalview software. The second 
structure of protein sequences were predicted by Jpred4. 

The homology model prediction was carried out through 
searching in PDB1018 database included in Fold and Function 
Assignment System (ffas.godziklab.org) 21 . Prediction of trans¬ 
membrane helices in proteins was carried out in TMHMM Server 
v. 2.0 online (http://www.cbs.dtu.dk/services/TMHMM/). 3D 
structure structures are aligned by pymol structure alignment tool. 

2.3. Compounds database 

Approved drug database was from the subset of ZINC database, ZDD 
(ZINC drug database) containing 2924 compounds"". Natural prod¬ 
ucts database was constructed by ourselves, containing 1066 chem¬ 
icals separated from traditional Chinese herbals in own laboratory 
and naturally occurring potential anti-viral components and de¬ 
rivatives. Anti-viral compounds library contains 78 known anti-viral 
drugs and reported anti-viral compounds through literature search. 

2.4. Homology modeling and molecular docking 

Corresponding homology models predicted by Fold and Function 
Assignment System server for each target protein were 


downloaded from Protein Data Bank (www.rcsb.org). Alignment 
of two protein sequences and subsequent homology modeling 
were performed by bioinformatics module of ICM 3.7.3 modeling 
software on an Intel i7 4960 processor (MolSoft LLC, San Diego, 
CA, USA). For the structure-based virtual screening, ligands were 
continuously resiliently made to dock with the target that was 
represented in potential energy maps by ICM 3.7.3 software, to 
identify possible drug candidates. 3D compounds of each database 
were scored according to the Internal Coordinate Mechanics (In¬ 
ternal Coordinate Mechanics, ICM) 23 . Based on Monte Carlo 
method, stochastic global optimization procedure and pseudo- 
Brownian positional/torsional steps, the position of intrinsic mo¬ 
lecular was optimized. By visually inspecting, compounds outside 
the active site, as well as those weakly fitting to the active site 
were eliminated. Compounds with scores less than —30 or 
mfScores less than —100 (generally represents strong interactions) 
have priority to be selected. Protein—protein docking was fol¬ 
lowed the manual of ICM software. 

3. Results 

3.1. Analysis, structure prediction and homology modeling of 
SARS-CoV-2 encoded proteins 

We obtained the SARS-CoV-2 genome from Gene Bank. The 
genome sequence of SARS-CoV-2/WHU02 was aligned with 
whole database using BLASTn to search for homology viral ge¬ 
nomes. After phylogenetic analysis and sequence alignment of 22 
coronaviruses from various species. We found three coronaviruses 
from bat (96%, 88% and 88% for Bat-Cov RaTG13, bat-SL- 
CoVZXC12 and bat-SL-CoVZC45, respectively) have the highest 
genome sequence identity to SARS-CoV-2 (Fig. 1A). Moreover, 
as shown in Fig. IB, Bat-Cov RaTG13 exhibited the closest 
linkage with SARS-CoV-2. Among all coronaviruses from human, 
SARS-CoV (80%) exhibited the highest genome sequence identity 
to SARS-CoV-2. And MERS/isolate NL13845 also has 50% 
identity with SARS-CoV-2. SARS-CoV is the most clearly studied 
one among all these viruses according to previous literatures. 
Structure and function of its most genome encoded proteins have 
been elucidated in recent years. In this study, because of high 
genome identity between SARS-CoV-2 and SARS-CoV, the 
structure and function prediction of SARS-CoV-2 genome enco¬ 
ded protein were mainly based on those researches on homology 
protein in SARS-CoV. SARS-CoV-2 genome has 10 open reading 
frames (Fig. 2A). ORFlab encodes replicase polyprotein 1 ab. 
After cleaved by two proteases, replicase proteins showed multi¬ 
function involved in transcription and replication of viral RNAs. 
ORF2-10 encodes viral structural proteins such as S, M, N, and E 
proteins, and other auxiliary proteins. The S, M, E proteins are 
involved in the formation of the viral coat, and the N protein is 
involved in the packaging of the RNA genome (Fig. 2C). By 
aligning with the amino acid sequence of SARS PPlab and 
analyzing the characteristics of restriction cleavage sites recog¬ 
nized by 3CLpro and PLpro, we speculated 14 specific proteolytic 
sites of 3CLpro and PLpro in SARS-CoV-2 PPlab (Fig. 2B). 
PLpro cleaves three sites at 181—182, 818—819, and 

2763—2764 at the N-terminus and 3CLpro cuts at the other 11 
sites at the C-terminus, and forming 15 non-structural proteins. 
Among them, Nsp3 contains multiple domains, including a 
segment of SARS unique domain and a deubiquitination and 
proteolytic enzyme PLpro. Nsp5 is 3CLpro, Nspl2 is an RdRp, 
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Figure 1 Phylogenetic analysis and sequence alignment of 22 coronaviruses from different hosts. (A) Homology tree of 22 nucleotide se¬ 
quences alignment. The homology of each branch is marked in red. (B) The evolutionary history was inferred using the Neighbor-Joining method. 
The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) is shown above the branches. 


and Nspl3 is helicase. As a new coronavirus, structure biology 
study about these proteins still at early stage. Until now, only one 
crystal structure of 3CLpro has been deposited in PDB (pdb code: 
6LU7). 

In order to acquire more three-dimensional structure infor¬ 
mation of proteins about these new coronaviruses for subsequent 
drug screening, we aligned all protein sequences from SARS- 
CoV-2 with all sequences in PDB 1018 database in Fold & 
Function Assignment System (Supporting Information Figs. 
S19—S34). Fortunately, most of these proteins have found their 
high homology proteins that have three-dimensional structure, 
with homology between 72%—99% (Supporting Information 
Table SI). Those PDB codes were labeled below the corre¬ 
sponding sequences in Fig. 2B. Unsurprisingly, all these proteins 
with the highest homology are from SARS. Nevertheless, there are 
some proteins still have not high homologous in the database. 
After the prediction of transmembrane helices in these proteins 
carried out in TMHMM Server, as expected, we found that all 
these proteins are transmembrane proteins except for Nsp2 
(Supporting Information Figs. S35—S41). So far, we have found 
as much structural information of this viral proteins as possible, 
which provides the basis for subsequent homology modeling and 
drug screening. Before homology modeling, all these proteins 
sequences were aligned with model sequences derived from 
SARS-CoV and predicted secondary structure as the same time 
(Fig. 3A and Table SI). Interestingly, as shown in Fig. 3A, 
important anti-virus drug target protein like 3CLpro, PLpro, and 
RdRp are highly conserved between those two human coronavi¬ 
ruses, especially in functional region. As shown in Table SI, all 
these potential drug target proteins have been homologously 
modeled, and all generated protein models were provided as the 


PDB files in Supporting Information. All other coordinates of 
target-screening hit complexes can be provided upon request. 

In order to verify the accuracy of homologous modeling, we 
aligned the computational structure of the SARS-CoV-2 3CLpro 
that modeled from the SARS-CoV 3CLpro with its crystal struc¬ 
ture of SARS-CoV-2 3CLpro (6LU7) just solved and released 
during this manuscript was being prepared. The computational 
model of SARS-CoV-2 3CLpro showed a Ca RMSD of 0.471 A 
on the overall structure compared to the SARS-CoV-2 3CLpro 
structure (Fig. 3B). What’s more, the Ca RMSD between their 
substrate binding regions is only 0.126 A, showing very subtle 
difference. 

ACE2 molecule was known as a human entry receptor for 
Spike which facilitates its cross-species transmission. Sequence 
alignment results show that the homology of the Spike-receptor 
binding domain (RBD) sequence between SARS-CoV-2 and 
SARS-CoV is 76% (Fig. 3C). Recent researches speculated that 
SARS-CoV-2 could also bind to ACE2, and this was verified by 
computational docking and ELISA measurement 24,25 . Moreover, 
homology of the Spike-RBD sequence between SARS-CoV-2 and 
Bat-CoV RaTG13 is as high as 95%. Despite the high homology 
of RaTG13 and SARS-CoV-2 in Spike sequence, our analysis 
found that four among the five most important amino acids (L465, 
L495, Y502, D510, and H514) that bind to ACE2 12 in Bat-CoV 
RaTG13 differ from SARS-CoV-2 (Fig. 3C). And there is no 
related research literature about whether Bat-CoV RaTG13 can 
infect human yet. We also performed homology modeling on the 
Bat-CoV RaTG13 Spike RBD (Supporting Information Fig. SI). 
Three Spike RBD structures have been docked with human ACE2. 
Among them, for the conformations which most resemble the 
crystal structure of SARS RBD—ACE2 complex 26 , the binding 
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Figure 2 Overall genome and protein analysis of SARS-CoV-2. (A) Genome of SARS-CoV-2. (B) The large replicase polyprotein PPlab. The 
purple rectangle represents the replicase polyprotein PPlab which contains 7096 amino acids. The position of red triangle means cleavage site of 
PLpro, and position of yellow triangle means the site of 3CLpro, the numbers below the purple rectangle represent the residues that bound the 
individual proteins or domains. Orange rectangle below the individual proteins or domains means exactly homology 3D structure were detected. 
(C) Four structure proteins are represented in the cartoon pattern of coronavirus. 


free energy between SARS-CoV-2 Spike RBD and human ACE2 
was —33.72 kcal/mol (Supporting Information Fig. S2), that be¬ 
tween SARS-CoV Spike RBD and ACE2 was —49.22 kcal/mol 
(Supporting Information Fig. S3), and that between Bat-CoV 
RaTG13 Spike RBD and ACE2 was —31.06 kcal/mol (Support¬ 
ing Information Fig. S4). 

3.2. Virtual ligand screening of potential drug targets ofSARS- 

CoV-2 

The therapies that act on the coronavirus can be divided into 
several categories based on the specific pathways: (1) some acting 
on enzymes or functional proteins that are critical to virus, pre¬ 
venting the virus RNA synthesis and replication; (2) some acting 
on structural proteins of virus, blocking virus from binding to 
human cell receptors, or inhibiting the virus’s self-assembly pro¬ 
cess; (3) some producing virulence factor to restore host’s innate 
immunity; (4) some acting on host’s specific receptors or en¬ 
zymes, preventing virus from entering into host’s cells. The 
related target proteins include Nspl, Nsp3 (Nsp3b, Nsp3c, PLpro, 


and Nsp3e), Nsp7_Nsp8 complex, Nsp9—NsplO, and 
Nspl4—Nspl6, 3CLpro, E-channel (E protein), ORF7a, Spike, 
ACE2, C-terminal RNA binding domain (CRBD), N-terminal 
RNA binding domain (NRBD), helicase, RdRp, and TMPRSSS2. 

Based on homology models of the above 18 viral proteins (19 
models) and two human targets, we resorted to structure-based 
virtual ligand screening method using ICM 3.7.3 modeling soft¬ 
ware (MolSoft LLC) to screen potential small-molecule com¬ 
pounds from a ZINC Drug Database (2924 compounds) and a 
small in-house database of traditional Chinese medicine and nat¬ 
ural products (including reported common anti-viral components 
from traditional Chinese medicine) and derivatives (1066 com¬ 
pounds). Compounds with lower calculated binding energies 
(being expressed with scores and mfScores) are considered to 
have higher binding affinities with the target protein. 

3.2.1. Targets preventing the virus RNA synthesis and 
replication 

As significant functional proteins of coronavirus, Nsps are 
involved in RNA transcription, translation, protein synthesis, 
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Figure 3 Sequence alignment of the proteins from SARS-CoV-2 with other CoVs. (A) Sequence alignment of PLpro, 3CLpro, RdRp among 
SARS-CoV, SARS-CoV-2 and MERS CoV. (B) The computational structure of SARS-CoV-2 3CLpro (shown as pink cartoon) modeled from 
SARS 3CLpro (3WA0) aligned with crystal structure of SARS-CoV-2 3CLpro (6LU7) (shown in blue cartoon). The computational model of the of 
SARS-CoV-2 3CLpro showed a Ca RMSD of 0.471 A on the overall structure compared to the SARS-CoV-2 3CLpro structure, and that showed a 
Cot RMSD of 0.126 A on the substrate binding region. (C) Sequence alignment of spike protein receptor binding domain among SARS-CoV, bat- 
CoVRaTG13 and SARS-CoV-2. “ A ” represents residues previously identified as importantly interact with human ACE2 from SARS. Black arrow 
marked amino acids that are different between SARS-CoV-2 and bat-CoVRaTG13. 
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processing and modification, virus replication and infection of the 
host. Among them, 3CLpro, PLpro, RdRp and helicase are the 
most important targets for the development of small-molecule 
inhibitors due to their clear biological functions and vital enzyme 

3.2.1.1. Papain-like proteinase (PLpro) 

PLpro is responsible for the cleavages of N-terminus of the 
replicase poly-protein to release Nspl, Nsp2 and Nsp3, which is 
essential for correcting virus replication 27 . PLpro was also 
confirmed to be significant to antagonize the host’s innate 
immunity 28- ’ 0 . As an indispensable enzyme in the process of 
coronavirus replication and infection of the host, PLpro has been a 
popular target for coronavirus inhibitors. It is very valuable for 
targeting PLpro to treat coronavirus infections, but no inhibitor 
has been approved by the IDA for marketing. 

The screening results (Table 1 and supporting excel file 
PLpro.xlsx) showed that a series of anti-virus drugs (ribavirin, 
valganciclovir and thymidine), anti-bacterial drugs (chloram¬ 
phenicol, cefamandole and tigecycline), muscle relaxant drug 
(chlorphenesin carbamate), anti-tussive drug (levodropropizine) 
may have high binding affinity to PLpro. The natural products 
(Table 2 and Supporting excel file PLpro_NP.xlsx), such as pla- 
tycodin D from Platycodon grandiflorus, baicalin from Scutellaria 
baicalensis, sugetriol-3,9-diacetate from Cyperus rotundas, phai- 
tanthrin D and 2,2-di (3-indolyl)-3-indolone from I satis indigo- 
tica, catechin compounds ((—)-epigallocatechin gallate and 2- 
(3,4-dihydroxyphenyl)-2-[[2-(3,4-dihydroxyphenyl)-3,4-dihydro- 
5,7-dihydroxy-277-1 -benzopyran-3-yl]oxy]-3,4-di hydro-277-1 - 
benzopyran-3,4,5,7-tetrol) exhibited high binding affinity to 
PLpro protein, suggesting the potential utility of these compounds 
in the treatment of SARS-CoV-2. 

Anti-viral drug ribavirin was predicted to bind to PLpro with 
low binding energy (Scores = —38.58). From generated docking 
model, ribavirin was bound in the active site of the enzyme as 
reported SARS-PLpro inhibitors (PDB code 3e9s). Hydrogen 
bonds were predicted between Glyl64, Gln270, Tyr274, Asp303 
and the compound. Also, it— it stacking was found between 
Tyr265 and triazolering in the compound (Fig. 4 A and B). The 
strong hydrogen bonding and hydrophobic interaction between 
ribavirin with the enzyme imply it may be a potent PLpro 
inhibitor. 

3.2.1.2. 3C-like main protease (3CLpro) 

The 3CLpro, also known as Nsp5, is first automatically cleaved 
from poly-proteins to produce mature enzymes, and then further 
cleaves downstream Nsps at 11 sites to release Nsp4—Nspl6 31 . 
3CLpro directly mediates the maturation of Nsps, which is 
essential in the life cycle of the virus. The detailed investigation 
on the structure and catalytic mechanism of 3CLpro makes 
3CLpro an attractive target for anti-coronavirus drug development. 
Inhibitors targeting at SARS-CoV 3CLpro mainly include peptide 
inhibitors and small-molecule inhibitors . 

3CLpro monomer has three domains, domain I (residues 
8—101), domain II (residues 102—184) and domain III (residues 
201—303), and a long loop (residues 185—200) connects domains 
II and III. The active site of 3CLpro is located in the gap between 
domains I and II, and has a CysHis catalytic dyad (Cysl45 and 


His41) . As shown in Table 3 and Supporting excel file 
3CLpro.xlsx), anti-bacterial drugs (lymecycline, demeclocycline, 
doxycycline and oxytetracycline), anti-hypertensive drugs (nicar¬ 
dipine and telmisartan), and conivaptan treating hyponatremia 
show highest binding affinity to 3CLpro. Several natural com¬ 
pounds and derivatives with anti-virus and anti-inflammatory ef¬ 
fects also exhibited high binding affinity to 3CLpro (Table 4 and 
Supporting excel file 3CLpro_NP.xlsx), including a series of 
andrographolide derivatives, chrysin-7-0-/3-glucuronide from S. 
baicalensis, betulonal from Cassine xylocarpa, 2/?-hydroxy-3,4- 
secofriedelolactone-27-oic acid, isodecortinol and cerevisterol 
from Viola diffusa, hesperidin and neohesperidin from Citrus 
aurantium, kouitchenside I and deacetylcentapicrin from the 
plants of Swertia genus. The above results suggest that these 
small-molecule compounds might be the potential 3CLpro in¬ 
hibitors and could probably be used for treating SARS-CoV-2. 

It’s worth mentioning, anti-asthmatic drug montelukast also 
showed low binding energy to 3CLpro. As shown in Fig. 5A, 
montelukast was well fitted into the active pocket of 3CLpro, in 
which lots of hydrophobic amino acids, just like Thr24, Leu27, 
His41, Phel40, Cysl45, Hisl63, Metl65, Prol68 and Hisl72 
compose a relatively hydrophobic environment to contain the 
compound and stabilize its conformation. Hydrogen bonding was 
predicted between Asnl42 and the carbonyl group of the com¬ 
pound (Fig. 5B). 


3.2.I.3. RNA-dependent RNA polymerase (RdRp) 

Nspl2, a conserved protein in coronavirus, is an RNA-dependent 
RNA polymerase (RdRp) and the vital enzyme of coronavirus 
replication/transcription complex. The RdRp domain of poly¬ 
merase is located at the C-terminus and has a conserved Ser-Asp- 
Asp motif 34 . Nsp8 can de novo synthesize up to six nucleotides in 
length, which can be used as a primer for Nspl2-RdRp RNA 
synthesis. Further, the Nsp7_Nsp8 complex increases the binding 
of Nspl2 to RNA and enhances the RdRps enzyme activity of 
Nspl2 35 . In the research of SARS-CoV and MERS-CoV in¬ 
hibitors, Nspl2-RdRp has been used as a very important drug 
target. In principle, targeted inhibition of Nspl2-RdRp could not 
cause significant toxicity and side effects on host cells, but no 
specific inhibitors have been found until now 36 . 

Virtual screening results of RdRp demonstrated some drugs might be 
potential inhibitors with mfScores lower than — 110, such as several anti¬ 
fungal drug itraconazole, anti-bacterial drug novobiocin, gallstone¬ 
dissolving drug chenodeoxycholic acid, anti-allergic chug cortisone, 
anti-tumor drag idarubicin, hepatoprotective drug silybin, muscle 
relaxant drug pancuronium bromide, and chronic enteritis, anti¬ 
coagulant chug dabigatran etexilate (Table 5 and Supporting excel file 
RdRp.xlsx). The natural products and derivatives with anti-virus, anti¬ 
inflammation and anti-tumor effects exhibited high binding affinity to 
RdRp, such as betulonal from C. xylocarpa, gnidicin and gniditrin from 
Gnidia lamprantha, 2/3,30/3-dihydroxy-3,4- 1 scco-friedelo]actone- 
27-lactone from V. diffusa, 14-deoxy-11,12-didehydroandrographolide 
from Andrographis paniculata, l,7-dihydroxy-3- methoxyxanthone 
from Swerti apseudochinensis, theaflavin 3,3'-di-0-gallate 
from Camellia sinensis, and andrographolide derivative (R)- 
((l/f,5aS,6R,9aS)-1,5a-dimethyl-7-methylene-3-oxo-6-((E)-2-(2-oxo- 
2,5-dihydrofuran-3-yl)ethenyl)decahydro-177-benzo[c]azepin-l- 
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Table 1 ( continued ) 

No. Drug name 

Structure Pharmacological 

function 

25 Dantrolene 

26 Sulfasalazine 

27 Silybin 

Muscle relaxant 

°^C 

„ -JfYx Anti-bacterial 

Hi effect 

JljjL* fy 0H Hepatoprotective 

' ' effect 

(f) Anti-hypertensive 

effect 

28 Nicardipine 

29 Sildenafil 

Y r 

tp 

W C , Treatment of 

dXW erectile 

"j 1 dysfunction 


yl)methyl 2-amino-3-phenylpropanoate (Table 6 and Supporting 
excel file RdRpJSIRxlsx). 

3.2.1.4. Helicase 

Helicase (Nspl3), a multi-functional protein, include N-terminal 
metal binding domain (MBD) and helicase domain (Hel). N-ter¬ 
minal structure contains 26 cysteine residues to form a Zn 2+ 
binding domain and helicase domain with a conserved motif at the 
C-terminus. Nspl3 can unravel double-stranded (ds) DNA and 
RNA along the 5'—3' direction in an NTP-dependent manner’ 7 . 
Importantly, it has been reported that the SARS-Nspl3 sequence 
is conserved and indispensable, and is a necessary component for 
the replication of coronavirus. Therefore, it has been identified as 
a target for anti-viral drug discovery, but there are few reports 
about Nspl3 inhibitors 38 ' 39 . 

Based on structure modeling of hehcase protein, anti-bacterial 
drugs (lymecychne, cefsulodine and rolitetracycline), anti-fungal 
drug itraconazole, anti-human immunodeficiency virus-1 (HTV-1) 
drug saquinavir, anti-coagulant drug dabigatran, and diuretic drug 
canrenoic acid were predicted to be hehcase inhibitors with high 
mfScores through virtual ligand screening. The natural products, 
such as many flavanoids from different sources (a-glucosyl hes- 
peridin, hesperidin, rutin, quercetagetin 6-0-|8-D-glucopyranoside 
and homovitexin), xanthones such as 3,5-dimethoxy-l-[(6-0-/J-D- 
xylopyranosyl-/3-D-glucopyranosyl)oxy]-97/-xanthen-9-one, kouit- 
chenside H, kouitchenside A, 8,2-dihydroxy-3,4,5-trimethoxy-l- 
f(6-0-/3-D-xylopyranosyl-/3-n-g]ucopyranosyl)oxy]-97/-xanthen-9- 
one, kouitchenside D, I -hydroxy-2,6-di methoxy-8-[(6-0-/3-n- 
xylopyranosyl-|3-D-glucopyranosyl)oxy]-977-xanthen-9-one and 
triptexanthoside D from Swertia genus, phyllaemblicin B and 
phyllaemblinol from Hyllantlu c bl a showed high binding 
affinity to this target. 

Besides the above targets, some non-structural proteins, 
including Nsp3b, Nsp3e, Nsp7_Nsp8 complex, Nsp9, NsplO, 
Nspl4, Nspl5, and Nspl6, also play an important role in the virus 
RNA synthesis and replication, suggesting these proteins may be 
useful targets for the anti-viral drug discovery. The virtual 
screening results showed many anti-bacterial, anti-viral, or anti¬ 
inflammatory drugs from ZINC drug database and our in-house 
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Table 2 

Potential PLpro inhibitors from ir 

i-house natural product database. 



No. 

Compound name 

Structure 

Pharmacological 

function 

Source 

1 

Platycodin D 


Anti-tumor, anti¬ 
inflammatory effect 

Platycodon 

grandiflorus 

2 

Chrysin 

BUI 

inflammatory effect 

Scutellaria 

baicalensis 

3 

Neohesperidin 

/y- 

allergic effect 

Citrus aurantium L. 






4 

Baicalin 


Anti-tumor, anti¬ 
inflammatory, anti¬ 

Scutellaria 

baicalensis 




bacterial, anti-vims 
effect 


5 

Sugetriol-3,9-diacetate 


Anti-HBV, 

anti-HSV-1 

Cyperus rotundas 

6 

(—)-Epigallocatechin gallate 

"vr-A” 

Anti-oxidation, anti¬ 

Camellia sinensis 



tumor, treatment of 




depression 


7 

Phaitanthrin D 



Isatis indigotica Fort. 

8 

2-(3,4-Dihydroxyphenyl)-2-[[2- 


Anti-oxidant, anti¬ 

Vitis vinifera 


(3,4-dihydroxyphenyl)-3,4- 

inflammatory, anti- 



dihydro-5,7-dihydroxy- 
2H- 1 -benzopyran-3-yl]oxy]- 
3,4-dihydro-2ff-1 -benzopyran- 
3,4,5,7-tetrol 




9 

2,2-Di (3-indolyl)-3-indolone 

,Jjk 

Anti-vims 

Isatis indigotica Fort. 

10 

(S)-(lS,2R,4aS,5R,8aS)-l- 

?* 

Anti-virus, anti¬ 

Andrographolide 


Formamido-1,4a-dimethyl-6- 

inflammatory effect 

derivatives 


methylene-5-((£)-2-(2-oxo- 

2,5-dihydrofuran-3-yl)ethenyl) 

decahydronaphthalen-2-yl-2- 

amino-3-phenylpropanoate 




11 

Piceatannol 


effect 

Vitis vinifera 

12 

Rosmarinic acid 


Anti-virus, 

anti-oxidant 

Salvia verticillata L. 

13 

Magnolol 


anti-microbial 

effect 

Magnolia officinalis 



natural products/derivatives database displayed potential good 
affinity to these targets, and the detailed information of virtual 
screening results is shown in Supporting excel files (for ZDD 
screening results, file names as target.xlsx; for natural products 
screening results, file names as target_NP.xlsx). 

3.2.2. Targets inhibiting virus structural proteins 
Spike is the main structural protein of coronavirus and assembles 
into a special corolla structure on the surface of the virus as a 
trimer. Spike is a main protein that interacts with the host by 
binding to host cell receptors to mediate virus invasion and 
determine viral tissue or host tropism 40 . Spike is cleaved into SI 
and S2 by the host cell protease like TMPRSS2, etc. The main 


function of SI is to bind with host cell surface receptors, and the 
S2 subunit mediates virus—cell and cell—cell membrane fusion. 
Spike structural integrity and cleavage activation play a key role in 
virus invasion and virulence 41 . Therapeutic strategies to block 
coronavirus from entering host cells by targeting Spike proteins or 
specific receptors on the host surface are valuable for the devel¬ 
opment of anti-viral drugs. 

On the basis of virtual screening results of small-molecule 
compounds against Spike protein, some drugs showed high 
binding affinity (mfScore<—150 or score<—35), such as anti¬ 
hypertensive drugs (rescinnamine, iloprost and prazosin), anti¬ 
fungal drugs (posaconazole and itraconazole), anti-bacterial drug 
(sulfasalazine, azlocillin, penicillin and cefsulodin) and anti- 
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Figure 4 Low-energy binding conformations of ribavirin bound to PLpro generated by molecular docking. (A) Superimposing of 2019-nCoV 
PLpro (yellow) with the crystal structure of SARS PLpro (orange) (PDB code 3e9s). (B) Detailed view of rivavirin binding in the active site of the 
enzyme (Supporting PDB file SARS_CoV-2_PLpro_homo_Ribavirin.pdb). 


coagulant drug dabigatran etexilate. Some natural flavanoids, 
licoflavonol from Glycyrrhiza uralensis, cosmosiin from S. bai- 
calensis, neohesperidin from C. aurantium, mangostin from 
Garcinia rnangostana, kouitchenside D from Swertia kouitchensis, 
excoecariatoxin from Excoecaria agallocha, phyllaemblicin G7 
from P. emblica, and piceatannol from Vitis vinifera, exhibited 
high binding affinity. The detailed virtual screening results are 
shown in Supporting excel files. 

However, most of above compounds were not predicted to 
bind with the binding interface of the Spike—ACE2 complex. The 
only compound that could target the binding interface between 
Spike and ACE2 was hesperidin, as shown in Fig. 6A. Hesperidin 
was predicted to lie on the middle shallow pit of the surface of 
RBD of Spike, where the dihydroflavone part of the compound 
went parallel with the /S-6 sheet of RBD. And the sugar part was 
inserted into the shallow pit in the direction away from ACE2, 
where a few hydrophobic amino acids, including Tyr436, Try440, 
Leu442, Phe443, Phe476, Try475, Try481 and Tyr49 form a 
relatively hydrophobic shallow pocket to contain the compound 
(Fig. 6B). Hydrogen bonding was predicted between Tyr440 and 
the compound. By superimposing the ACE2—RBD complex to 


the hesperidin—RBD complex, a distinct overlap of hesperidin 
with the interface of ACE2 could be observed (Fig. 6C), sug¬ 
gesting hesperidin may disrupt the interaction of ACE2 with 
RBD. 

Except for Spike protein, E protein (E-channel) possesses 
important biological functions for the structural integrity of 
coronavirus and host virulence. NRBD and CRBD of corona- 
virus N protein are needed for N proteins in host cells to bind 
with coronavirus RNA efficiently. Therefore, E protein or N 
protein (NRBD and CRBD domains) can be used as targets for 
the discovery of anti-viral drugs. Through virtual screening, 
many anti-bacterial, anti-viral, anti-tumor, anti-asthmatic, and 
anti-inflammatory drugs, etc. from ZINC database and our in- 
house natural products/derivatives database were found to 
display relatively good affinity to these targets. And the detailed 
results of virtual screening are given in Supplementary excel 
files. 

3.2.3. Targets inhibiting virulence factor 

There are three coronavirus virulence factors Nspl, Nsp3c and 

ORF7a related to interfering host’s innate immunity and assisting 
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Table 3 

Potential 3CLpro inhibitors from ZINC drug database. 


No. 

Drug name 

Structure 

Pharmacological functions 

1 

Lymecycline 


Anti-bacterial effect 

2 

Chlorhexidine 

- Via 

Anti-bacterial effect 

3 

Alfuzosin 

Vo 

Anti-hypertensive agent, benign 




prostatic hyperplasia 

4 

Cilastatin 


Renal peptidase inhibition 

5 

Famotidine 

•*£{3 — 

Anti-ulcerative activity 

6 

Almi trine 


Treatment of cognitive and chronic 



pr 

sensory nerve impairment 



, P 0^X 


7 

Progabide 


Anti-epileptic effect 



cnx 


8 

Nepafenac 


Treatment of pain and inflammation 
associated with cataract surgery 

9 

Carvedilol 


Vasodilator effect 



V 


10 

Amprenavir 


HIV-1 protease inhibition 

11 

Tigecycline 


Anti-bacterial effect 

12 

Demeclocycline 


Anti-bacterial effect 

13 

Monteluk; 


Anti-allergic, anti-asthmatic effects 

14 

Carminic acid 

-56^^ 

Food additive 

15 

Mimosine 


Depilatory effect 

16 

Flavin mononucleotide 

7X6:. 

Electron transference in biological 
oxidation 



-’x 


17 

Lutein 

— rVX 

Vision protection, 
anti-oxidation 

18 

Cefpiramide 

" 6 

Anti-bacterial effect 

19 

Phenethicillin 


Anti-bacterial effect 

s > 0H 
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Table 3 ( continued ) 


No. 

Drug name 

Structure 

Pharmacological functions 

20 

Candoxatril 


Anti-hypertensive effect 

21 

Nicardipine 


Anti-hypertensive effect 

22 

Estradiol valerate 

mm 

Treatment of estrogen deficiency 

23 

Pioglitazone 


Anti-diabetic effect 

24 

Conivaptan 


Treatment of hyponatremia 

25 

Telmisartan 


Anti-hypertensive effect 

26 

Doxycycline 


Anti-bacterial effect 

27 

Oxytetracycline 


Anti-bacterial effect 


coronavirus immune escape. Nspl interacts with host 40S ribo- 
somal subunit that induces specifically host mRNA degradation 42 
and also inhibits type-I interferon production 43 . Nsp3c has ability 
to bind with host’s ADP-ribose to help coronavirus resist host 
innate immunity 44 . Bone marrow matrix antigen 2 (BST-2) can 
inhibit the release of newly-assembled coronavirus from host 
cells. SARS-CoV ORF7a directly binds to BST-2 and inhibits its 
activity by blocking the glycosylation of BST-2 45 . These evi¬ 
dences suggest that Nspl, Nsp3c and ORF7a may be potential 
targets for anti-viral drug discovery. 

The detailed screening results of Nspl, Nsp3c, and ORF7a 
showed that a series of clinical drugs and natural products with 
anti-bacterial and anti-inflammatory effects exhibited relatively 
high binding affinity to these three target proteins, such as 
piperacillin, cefpiramide, streptomycin, lymecycline, tetracycline, 
platycodin D from Platycodon grandifloras, wogonoside from S. 
baicalensis, vitexin from Vitex negundo, andrographolide de¬ 
rivatives, and xanthones from Swertia genus. The detailed results 
of virtual screening are shown in Supporting excel files. 

3.2.4. Targets blocking hose specific receptor or enzymes 
The host ACE2 has been proved by many studies to be the specific 
receptor for the Spike RBD of SARS-CoV 12 . The latest research 
shows that the host receptor of SARS-CoV-2 is consistent with 
SARS-CoV, exhibiting that the Spike RBD sequence of SARS- 
CoV-2 is similar to SARS-CoV RBD and there are important in¬ 
teractions between several key amino acid residues of RBD 
receptor-binding motif and ACE2 24 . Based on the current research 
progress, ACE2 is considered as a host target for the treatment of 
coronavirus infection to block SARS-CoV-2 from entering host 
cells. 

Based on the virtual screening results of ACE2 protein, anti¬ 
diabetes drug troglitazone, anti-hypertensive drug losartan, anal¬ 
gesia drug ergotamine, anti-bacterial drug cefmenoxime, and 


hepatoprotective drug silybin, etc., were predicted to bind with 
ACE2 with low energy. The natural products, such as phyllaem- 
blicin G7 from P. emblica, xanthones from the plants of Swer¬ 
tia genus, neohesperidin and hesperidin from C. aurantium, 
exhibited potentially high binding affinity to ACE2 protein. 
However, none of above ACE2 binding compounds was predicted 
to target the ACE2—RBD interface. 

In addition, TMPRSS2 was known to cut the Spike to trigger 
the infection of SARS-CoV and MERS-CoV. Studies have shown 
that inhibiting the enzyme activity of TMPRSS2 can prevent some 
coronaviruses from entering host cells 46 . As a possible target for 
anti-viral drug discovery, the virtual screen results (shown in 
Supporting excel files) predicted many anti-bacterial drugs (piv- 
ampicillin, hetacillin, cefoperazone and clindamycin) and anti¬ 
virus natural compounds (phyllaemblicin G7, neo- 
andrographolide, kouitchenside I), etc. to be potential TMPRSS2 
inhibitors. 

3.3. Virtual screening and target identification of common anti¬ 
viral drugs 

In order to further verify the screening results of ZINC drug 
database and utilize the current resource of anti-viral drugs 
immediately, we constructed a database of 78 anti-viral drugs for 
deep calculation, including compounds already on the market and 
currently undergoing clinical trials to treat SARS-CoV-2 in¬ 
fections. The compounds were docked with 19 constructed targets 
of SARS-CoV-2 and two human targets to predict their possible 
targets, and also to search possible binding partners of certain 
important targets. Special attentions were paid to the drugs 
currently in clinical trials. All results of calculated docking scores 
were listed in the Supporting excel file (anti-virals_vs_targets_- 
scores), significant scores (score<—32, mfScore<—110) were 
highlighted in brighten yellow. 
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Table 4 

No. 




6 




9 


10 


12 


13 


14 


Potential 3CLpro inhibitors from in-house natural product database. 

Drug name Structure Pharmacological 

functions 


(lS,2R,4aS,5/?,8aS)-l-Formamido- 
1,4a-dimethyl-6-methylene-5- 
((E)-2-(2-oxo-2,5-dihydrofuran-3- 
yl)ethenyl)decahydronaphthalen- 
2-yl 5-((/?)-l,2-dithiolan-3-yl) 
pentanoate 
Betulonal 


Chrysin-7-0-/3-glucuronide 


Andrographiside 


(lS,2R,4aS,5/?,8aS)-l-Formamido- 
1,4a-dimethyl-6-methylene-5- 
((E)-2-(2-oxo-2,5-dihydrofuran-3- 
yl)ethenyl)decahydronaphthalen- 
2-yl 2-nitrobenzoate 
2/3-Hydroxy-3,4-j«:o- 
Medelolactone-27-oic acid 


(S)-(lS,2R,4aS,5R,8aS)-l- 
Formamido-1,4a-dimethyl-6- 
methylene-5-((£)-2-(2-oxo-2,5- 
dihydrofuran-3-yl)ethenyl) 
decahydronaphthalen-2-yl-2- 
amino-3-phenylpropanoate 
Isodecortinol 


Cerevisterol 


Flesperidin 


Neohesperidin 


Andrograpanin 


2-((lR,5i?,6«,8aS)-6-Hydroxy-5- 
(hydroxymethyl)-5,8a-dimethyl-2- 
methylenedecahy dronaphthalen-1 - 
yl)ethyl benzoate 

Cosmosiin 



H 






n 

/o 



Anti-vims, anti¬ 
inflammatory effect 


Anti-vims, anti¬ 
inflammatory effect 

inflammatory effect 


Anti-vims, anti¬ 
inflammatory effect 



Anti-vims, anti¬ 
inflammatory effect 


Anti-inflammatory, 
anti-oxidant effect 


allergic effect 


Anti-vims, anti¬ 
inflammatory effect 


inflammatory effect 


Anti-inflammatory, 
anti-oxidant, anti- 
HIV effect 


Source 

Andrographolide derivatives 

Cassine xylocarpa 

Scutellaria baicalensis 

Andrographis paniculata 

Andrographolide derivatives 

Viola diffusa 

Andrographolide derivatives 

Viola diffusa 

Viola diffusa 

Citrus aurantium L. 

Citrus aurantium L. 

Andrographis paniculata 

Andrographolide derivatives 

Scutellaria baicalensis 
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Table 4 ( continued ) 


No. 

Drug name 

Structure 

Pharmacological 

functions 

Source 

15 

Cleistocaltone A 


Anti-virus 

Cleistocalyx operculatus 

16 

2,2-Di(3-indolyl)-3-indolone 


Anti-virus 

Isatis indigotica Fort. 

17 

Biorobin 

‘Wf’ 


Ficus benjamina 

18 

Gnidicin 


Anti-tumor 

Gnidia lamprantha 

19 

Phyllaemblinol 


Anti-virus 

Phyllanthus emblica 

20 

Theaflavin 3,3'-di-0-gallate 

jS N&. 

Anti-oxidant effect, 

Camellia sinensis 

21 

Rosmarinic acid 


Anti-viral, 

anti-oxidant 

Salvia verticillata L. 

22 

Kouitchenside I 


inflammatory effect 

Swertia kouitchensis 

23 

Oleanolic acid 


inflammatory effect 

Swertia kouitchensis 

24 

Stigmast-5-en-3-ol 

H °V<^ 

A 

Anti-virus, anti¬ 
inflammatory effect 

Swertia binchuanensis 

25 

Deacetylcentapicrin 

jtr 

*j6xc 

Anti-virus, anti¬ 
inflammatory effect 

Swertia macrosperma 

26 

Berchemol 


Anti-virus, anti¬ 
inflammatory effect 

Swertiabi maculata 


Remdesivir (GS-5734), a nucleoside analogue, is an RdRp in¬ 
hibitor. It can inhibit virus by inhibiting synthesis of viral nucleic acid 
and has not yet been approved for marketing in any country. On 
January 31 of this year, the New England Journal of Medicine re¬ 
ported the diagnosis and treatment of the first SARS-CoV-2 patient in 
the United States 47 . Remdesivir showed some potential in the treat¬ 
ment of the first patient with novel coronavirus infection. On Vero E6 
cells, the EC 50 of remdesivir for SARS-CoV-2 is 0.77 pmol/L, and the 
selection index SI is greater than 129 48 . Remdesivir is currently un¬ 
dergoing a randomized, double-blind, controlled phase III clinical 
study in China. Our molecular docking results show that the potential 
targets of remdesivir is Nsp3b (score = —36.5), RdRp (mfScores 
= -112.8), E-channel (mfScore = -125.1), and TMPRSS2 
(score = —36.23, mfScores = —109.4). 

For RdRp, from generated docking model, remdesivir could 
bind to the RNA-binding channel of the SARS-CoV-2 RdRp, and 


the binding mode and site were highly similar to that of cox¬ 
sackievirus B3 (CVB3) RdRp inhibitor GPC-N114 49 .The binding 
pocket of remdesivir was in the bottom of the RNA template 
channel, which position was for the acceptor template nucleotide 
(Fig. 7A and B). The compound was well fitted with the shape of 
the pocket, where it formed three hydrogen bonds with Asn497, 
Arg569 and Asp684. In addition, hydrophobic interactions with 
Leu576, Ala685 and Tyr687 may further direct the favorite 
conformation of remdesivir (Fig. 7C). 

Interestingly, remdesivir was predicted to bind with the target 
TMPRSS2 with low binding energy for both score and mfScore. 
As shown in Fig. 8A, remdesivir was bound in a relatively 
positively-charged allosteric pocket which is far away from the 
enzyme active center. Asn84 and Arg405 formed two hydrogen 
bonds with the phosphate groups of the compound. Weak hydro- 
phobic interaction between the pyrrolotriazine ring of remdesivir 
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Figure 5 Low-energy binding conformations of Montelukast bound to 3CLpro generated by molecular docking. (A) Montelukast was fitted 
well in the active pocket of SARS-CoV-2 3CLpro, 3CLpro was shown as electrostatic surface model. (B) Detailed view of montelukastin binding 
in the active pocket of 3CLpro (Supporting PDB file SARS_CoV-2_3CLpro_homo_Montelukast.pdb). 


with Tyrl31 and Try401, and side chains of some polar amino 
acids may further stabilize the compound conformation (Fig. 8B). 

Lopinavir and ritonavir have been marketed in China and are 
mainly used to treat HIV-1 infection in adults and children over 2 
years of age. In vitro studies have shown that lopinavir and rito¬ 
navir can inhibit the replication of MERS-CoV and SARS-CoV to 
exert anti-viral effects. The drug is listed as a recommended drug 
in the Anti-viral Drugs section of the New Coronavirus Infected 
Pneumonia Diagnosis and Treatment Program. The molecular 
docking results showed that ritonavir’s possible target is Nsp3c or 
E-channel with the mfScores of —152.100 and —277.769, 
respectively. Lopinavir’s possible target is Nsp3b, Nsp3c, helicase, 
NRBD or E-channel with the mfScores of —158.050, —189.140, 
-114.018, -171.127, and -221.785, respectively. 

Arbidol is a broad-spectrum anti-viral drug, mainly for the 
treatment of upper respiratory tract infections caused by influenza 
A and B viruses, etc. In recent years, many studies have proven its 
effectiveness against both SARS-CoV and MERS-CoV. Arbidol 
hydrochloride can block virus replication by inhibiting the fusion 
of the lipid membrane of the virus with the host cells. Compared 
with the untreated control group, arbidol can effectively inhibit 
coronavirus up to 60 times at a concentration of 10—30 pmol/L, 
and significantly inhibit the virus’s pathological effects on cells. 
The docking results of arbidol with the possible drug targets of the 
new coronavirus showed that it may interact with Nsp7_Nsp8 
complex, Nspl4, Nspl5, E-channel, or Spike with the mfScores of 
-136.087, -118.253, -118.253, -117.879, and -145.125, 
respectively. 

Darunavir is an HIV-1 protease inhibitor that selectively in¬ 
hibits the cleavage of HIV-encoded Gag-Pol polyprotein in virally 
infected cells, thereby preventing the formation of mature infec¬ 
tious virus particles. At a concentration of 300 pmol/L, darunavir 
can significantly inhibit virus replication, and the inhibition effi¬ 
ciency is 280 times compared with the untreated group. Our 
docking results showed that the possible targets of darunavir are 
Nsp3c, PLpro, E-channel or Spike proteins with the mfScores of 
-126.149, -110.759, -157.184, and -111.865, respectively. 

Favipiravir, a broad-spectrum anti-viral drug, is used to treat 
flu. The Shenzhen Health Commission has now initiated clinical 
studies on the use of favipiravir to treat SARS-CoV-2 infections. 


While, the scores of favipiravir docking with the targets in our 
virtual screening are relatively low. 

Chloroquine phosphate has been used in the treatment of ma¬ 
laria since the 1940s and later in rheumatoid arthritis. Chloroquine 
has been reported in some earlier studies to have direct anti-viral 
effects, such as inhibiting flavivirus, retrovirus (such as HIV), and 
many coronaviruses. Recent study showed that with EC 50 of 
1.13 pmol/L and SI greater than 88, chloroquine can effectively 
inhibit SARS-CoV-2 in the cell level. Its efficiency in the human 
body for SARS-CoV-2 infection has not yet been clinically 
proven. Chloroquine phosphate was turned into chloroquine in the 
body to play therapeutic effect. Our docking results showed that 
the possible target of chloroquine is Nsp3b or E-channel with the 
docking mfScores of —130.355 and —107.889, respectively. 

We also performed the longitudinal analysis on the drugs 
against 21 targets, and the results showed that only tenofovir, 
disoproxil, fumarate, and beclabuvir may bind to Nspl. Fewer 
compounds were predicted to act on some targets, such as Nspl, 
Nsp3e, Nsp9, NsplO, Nspl6, NRBD, CRBD, ORF7a, and 
TMPRSS2, showing that these selected anti-viral drugs are un¬ 
likely to acting on the above targets of the new coronavirus, which 
provides a meaningful reference for our future research. 

For other targets, such as Nsp3b, Nsp3c, Nsp7_Nsp8 complex, 
Nspl4, Nspl5, PLpro, 3CLpro, RdRp, helicase, E-channel, Spike 
and ACE2, more anti-viral drugs were predicted to bind with 
them, especially E-channel, RdRp, 3CLpro and PLpro, indicating 
that these targets are more likely to be useful for the discovery of 
SARS-CoV-2 therapeutic drugs from known anti-virals, and 
should be the focus of our subsequent research. 


4. Discussion 

The ongoing SARS-CoV-2 epidemic makes us painfully realize 
that our current options for treating life-threatening zoonotic 
coronavirus infections are very limited. Although the outbreaks of 
SARS in 2003 and MERS-CoV in 2012 triggered extensive 
research efforts, there are currently no drugs that can treat any 
zoonotic coronavirus. The transient nature of this epidemic is one 
of the major reasons why no prototype coronavirus inhibitor has 
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Table 5 

Potential RdRp inhibitors from ZINC drug database. 


No. 

Drug name 

Structure 

Pharmacological function 

1 

Valganciclovir 


Anti-virus 

2 

Chlorhexidine 

-Yi?tx 

Anti-bacterial effect 

3 

Ceftibuten 


Anti-bacterial effect 

4 

Fenoterol 

H ^Cnr^ r “ 

Treatment of bronchial asthma 

5 

Fludarabine 


Anti-tumor 

6 

Itraconazole 


Anti-fungal effect 

7 

Cefuroxime 


Anti-bacterial effect 

» 

Atovaquone 

-0-0 

Anti-malaria 

9 

Chenodeoxycholic acid 


Dissolving gallstones 

10 

Cromolyn 


Treatment of allergic asthma 

11 

Pancuronium bromide 


Muscle relaxant effect 

12 

Cortisone 

.. . 1*#* 

Anti-allergic effect 

13 

Tibolone 


Contraceptive effect 

14 

Novobiocin 


Anti-bacterial effect 

15 

Silybin 


Hepatoprotective effect 

16 

Idambicin 

J t&0 

Anti-tumor 

17 

Bromocriptine 


Treatment of amenorrhea 

18 

Diphenoxylate 

xx 

Treatment of diarrhea and chronic enteritis 

19 

Benzylpenicilloyl G 


Skin-testing reagent to detect penicillin allergy 

20 

Dabigatran etexilate 


Anti-coagulant effect 
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Table 6 Potential RdRp inhibitors from in-house natural product database. 

No. Drug name Structure Pharmacological 

function 


1 Betulonal 


3 2/3,30/3-Dihydroxy-3,4-reco- 

friedelolactone-27-lactone 

4 14-Deoxy-11,12- 

didehydroandrographolide 


5 Gniditrin 


6 Theaflavin 3,3'-di-0-gallate 


7 (R)-((lR,5aS,6R,9aS)-l,5a- 

Dimethyl-7-methylene-3-oxo-6- 
((E)-2-(2-oxo-2,5-dihydrofuran-3- 
yl)ethenyl)decahydro-17f-benzo 
[c]azepin-l-yl)methyl 2-amino-3- 
phenylpropanoate 

8 2/?-Hydroxy-3,4 -seco- 

friedelolactone-27-oic acid 

9 2-(3,4-Dihydroxyphenyl)-2- [[2-(3,4- 

dihydroxyphenyl)-3,4-dihydro- 
5,7 -dihydroxy-2ff-1 -benzopyran- 
3-yl|oxy|-3,4-dihydro-2//-1 - 
benzopyran-3,4,5,7-tetrol 

10 Phyllaemblicin B 


11 14-Hydroxycyperotundone 


12 Andrographiside 


13 


14 


2-((lR,5R,6R,8aS)-6-Hydroxy-5- 
(hydroxymethyl)-5,8a-dimethyl-2- 
methylenedecahydronaphthalen-1 - 
yl)ethyl benzoate 

Sugetriol-3,9-diacetate 



Jki 



Anti-HIV-1 


Anti-virus, anti¬ 
inflammatory effect 


Anti-oxidant, anti- 



Anti-virus, anti¬ 
inflammatory effect 


Anti-oxidant, anti¬ 
inflammatory, anti- 


Anti-HBV 


Anti-virus, anti¬ 
inflammatory effect 


Anti-virus, anti¬ 
inflammatory effect 


Anti-HBV 


Source 


Cassine xylocarpa 


Gnidia lamprantha 


Androgrciphis paniculata 


Gnidia lamprantha 


Camellia sinensis 


Andrographolide derivatives 


Vitis vinifera 


Phyllanthus emblica 


Cyperus rotundas 


Androgra phispaniculata 


Andrographolide derivatives 


Cyperus rotundus 
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Table 6 ( continued ) 


No. 

Drug name 

Structure 

Pharmacological 

function 

Source 

15 

Baicalin 

fp* 

Anti-tumor, anti¬ 
inflammatory, anti¬ 
bacterial, 
anti-virus effect 

Scutellaria baicalensis 

16 

(lS,2R,4aS,5R,8aS)-l-Formamido- 


Anti-virus, anti¬ 

Andrographolide derivatives 


1,4a-dimethyl-6-methylene-5 - 
((£)-2-(2-oxo-2,5-dihydrofuran-3- 
yl)ethenyl)decahydronaphthalen- 
2-yl 5-((/?)-i ,2-dithiolan-3-yl) 
pentanoate 

Y H 

inflammatory effect 


17 

1,7-Dihydroxy-3-methoxyxanthone 

"XCOo 

inflammatory effect 

Swertia pseudochinensis 

18 

l,2,6-Trimethoxy-8-[(6-0-d-D- 

HO ?H 

Anti-virus, anti¬ 

Swertia mussotii 


xylopyranosyl-fJ-D- 

glucopyranosyl)oxy]-97f-xanthen- 

9-one 

0? 

*0 . <4C 

inflammatory effect 


19 

1,8-Dihydroxy-6-methoxy-2- [(6-0- 
/3-o-xylopyranosyl-d-D- 
glucopyranosyl)oxy]-97f-xanthen- 
9-one 


inflammatory effect 

Swertia kouitchensis 


20 

8-(/3-D-Glucopyranosyloxy)-l ,3,5- 

H( S 

Anti-virus, anti¬ 

Swertia mussotii 


trihydroxy-9W-xanthen-9-one 

o„ „ xc 

inflammatory effect 








progressed to the (early) preclinical stage to date. As with the 
SARS virus 17 years ago and the current SARS-CoV-2, emerging 
coronaviruses in the future may continue to pose a threat to global 
public health. Therefore, finding broad-spectrum inhibitors that 
may reduce the effects of human coronavirus infection remains a 
challenging research focus. Given the time-consuming nature of 
anti-viral drug development and registration, existing treatments 
for other diseases may be the only fastest treatment option for 
emerging infectious diseases. For most of these drugs that have 
been prepared, the medication has sufficient experience and 
dosage, and their safety and ADME situation are well known. 

In this study, based on the results of bioinformatics analysis, 20 
homology structures of 18 SARS-CoV-2 proteins and one human 
protein were built, plus human ACE2 structure, totally 21 targets 
were setup for high throughput virtual ligand screening. The 
special double scoring system ICM-Pro soft allowed us to evaluate 
the docking results more accurately. The score of ICM software is 
calculated by the overall empirical function of predicted physical 
interaction, whiles the mfScore (mean force Score) is computed 
by the knowledge-based potential functions derived from statistics 
of ligand—receptor complex in PDB 50,51 . 

From protein—protein docking results, we can deduce that 
SARS-CoV-2 Spike protein have strong binding affinity to 
human ACE2, although weaker than that of SARS-CoV Spike 
protein. Unexpectedly, Bat-CoV RaTG13 Spike protein only has 
slightly weaker binding affinity with ACE2 compared to SARS- 
CoV-2. It is speculated that the virus may be capable of infecting 
humans directly, or it is close. If SARS-CoV-2 evolved from Bat- 
CoV RaTG13, there may be no more than one or two host 
evolutions involved. But there are no published articles that 
directly related to intermediate hosts yet. According to the most 


recent research paper on the clinical characteristics of SARS- 
CoV-2 infection 52 , SARS-CoV-2 can spread rapidly by human- 
to-human transmission, this was consistent with observed 
protein—protein docking results of SARS-CoV-2 Spike RBD 
and human ACE2, for both of ours and previous prediction 25 . 
Flowever, the fast growing incidents of SARS-CoV-2 pneumonia 
and related deaths implied the weaker binding of SARS-CoV-2 
Spike with human ACE2 compared to SARS Spike may not 
fully explained the current situation of this epidemic. There must 
have some other receptors of SARS-CoV-2 or infection facili¬ 
tating mechanisms in the human host, or SARS-CoV-2 Spike- 
RBD may take other lower energy binding conformation 
different from current SARS-CoV Spike-RBD conformation 
found in the crystal structure, probably by induce fitting. These 
need to be further investigated. 

To search potential coronavirus therapeutic drugs as soon as 
possible, we first screened potential compounds from a ZINC drug 
database (2924 compounds) and a small in-house database of 
natural products (about 1066 compounds). A series of clinical 
drugs and natural products with anti-viral, anti-bacterial and anti¬ 
inflammatory effects exhibited high binding affinity to different 
target proteins, indicating their potential utility for treating SARS- 
CoV-2. 

We have identified a number of compounds that might have anti¬ 
viral activity from the approved drugs library, such as anti-virus 
drugs (ribavirin, valganciclovir and thymidine), anti-bacterial drugs 
(cefpiramide, sulfasalazine, phenethicillin, lymecycline, demeclo- 
cycline, doxycycline, oxytetracycline and tigecycline), anti¬ 
asthmatic drugs (montelukast, fenoterol and reproterol), and hep- 
atoprotective drug silybin. The original pharmacological actions of 
these drugs could be helpful for the therapy of viral infection 
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Figure 6 Low-energy binding conformation of hesperidin bound to Spike RBD generated by molecular docking. (A) Hesperidinwas fitted into 
the shallow pocket in the surface of SARS-CoV-2 Spike RBD. (B) Detailed view of hesperidinbinding in the pocket of Spike RBD. (C) Hesperidin 
blocks the interface of ACE2 and Spike RBD binding (Supporting PDB file SARS_CoV-2 _Spike_RBD_homo_Hesperidin.pdb). 


pneumonia. The natural products, such as flavanoids like neo- 
hesperidin, hesperidin, baicalin, kaempferol 3-O-rutinoside and 
rutin from different sources, andrographolide, neoandrographolide 
and 14-deoxy-ll,12-didehydroandrographolide from A. pan- 
iculata, and a series of xanthones from the plants of Swertia genus, 
with anti-virus, anti-bacteria and anti-inflammation activity could 


effectively interact with these targets of SARS-CoV-2. Therefore, 
the herbal medicines containing these compounds as major com¬ 
ponents might be meaningful for the treatment of SARS-CoV-2 
infections. 

For ACE2 target, although several compounds could bind 
with ACE2 through virtual screening in our studies, no 
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Figure 7 Low-energy binding conformation of remdesivir bound to SARS-CoV-2 RdRp generated by molecular docking. (A) Remdesivir was 
fitted in the bottom of the RNA template channel (top view). (B) Remdesivir was fitted in the bottom of the RNA template channel (bottom view). 
(C) Detailed view of remdesivir binding with RdRp (Supporting PDB file SARS_CoV-2_RdRp_homo_Remdesivir.pdb). 


compound was found to bind with the contact surface of 
ACE2—Spike complex, suggesting that these compounds are 
only the inhibitors of ACE2 enzyme activities, rather than in¬ 
hibitors of ACE2 driven virus infections. Just like what described 
in recently published research 53 , most of selected compounds are 
also unable to bind with the contact surface of ACE2—Spike 
complex. Actually, these potential ACE2 inhibitors may not be 
suitable to use as drugs for treating SARS-CoV-2 infection 
because the poor prognosis would be induced by the inhibition of 
ACE2 enzyme activities, for ACE2 was considered as a protec¬ 
tive factor of lung injury 54 . 

For those targets which are difficult to find direct inhibitors, or 
non-druggable targets, just like Nspl, Nsp3b, Nsp3c, and 
E-channel, etc., currently popular PROTAC technology may be a 
good strategy to degrade these proteins and then inhibit the virus. 
The potential binding compounds found in this study for these 
targets might be a good start point. 

For Spike protein, we found only one compound, natural hes- 
peridin was targeting the binding between Spike RBD and human 
ACE2. However, not like the ACE2 binding compounds, non¬ 
interface binding compound may still meaningful applications, 
considering that the fusion of CoVs membrane with host cell mem¬ 
brane need the big conformational change of remained Spike part 
after RBD removal 55 . Any small molecule bound to Spike at this time 
may interfere the re-folding of Spike therefore inhibits the viral 
infection process. Furthermore, small molecule that can target any 
part of Spike protein may be a good start point to design PROTAC 
based therapy. 


Also, we dock existing anti-viral drugs with our targets, 
analyze the possible targets of each anti-viral drug horizontally, 
and analyze the drugs that may interact with 21 targets vertically. 
We analyzed 21 targets based on the docking results and found 
that Nsp3b, Nsp3c, Nsp7_Nsp8 complex, Nspl4, Nspl5, PLpro, 
3CLpro, RdRp, helicase, E-channel, Spike and ACE2 are more 
likely to be therapeutic targets of anti-viral drugs. The three tar¬ 
gets Nsp3b, Nsp3c, and E-channel are screened more anti-viral 
drugs. This may be due to the model problem because of flexible 
small protein (Nsp3b and Nsp3c) or partial model (E-channel). 
Whether the screened anti-viral drugs really work on these targets 
needs further verification. We also do not recommend the appli¬ 
cation of new coronavirus pneumonia to compounds for which no 
target has been predicted. 

The triphosphate nucleotide product of remdesivir, remdesivir- 
TP, competes with RdRp for substrate ATP, so it can interfere with 
viral RNA synthesis. Our docking results show that remdesivir-TP 
binds to SARS-CoV-2 RdRp, with a score of —112.8, and the 
docking results are consistent with its original anti-viral mecha¬ 
nism, so we think remdesivir may be good in treating SARS-CoV- 
2 pneumonia. In addition, remdesivir also predicted to bind with 
the human TMPRSS2, a protein facilitating the virus infection, 
this is a new discovery and provides ideas for subsequent research. 

Chloroquine phosphate has shown better anti-SARS-CoV-2 
effects in recent studies, but this drug has no clear target of action. 
In our docking results, chloroquine phosphate is predicted to 
possibly combine with Nsp3b and E-channel. But we need to do 
further experiments to verify this conclusion. 
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Figure 8 Low-energy binding conformation of remdesivir bound to human TMPRSS2 generated by molecular docking. (A) Remdesivir was 
fitted in an allosteric pocket far away from the enzyme active center (bottom). (B) Detailed view of remdesivir binding with TMPRSS2 (Sup¬ 
porting PDB file HS_TMPRSS2_homo_Remdesivir.pdb). 


In response to the recently reported anti-AIDS drugs lopinavir 
and ritonavir tablets, which have a poor effect on the treatment of 
novel coronavirus pneumonia and have toxic side effects, we 
analyzed it in conjunction with the docking results. The molecular 
docking results show that ritonavir’s possible target is Nsp3c or E- 
channel. Lopinavir’s possible target is Nsp3b, Nsp3c, helicase, 
NRBD or E-channel. Some of these targets (such as Nsp3b, 
Nsp3c, E-channel) may be false positives due to the model inac¬ 
curacy for small flexible protein or partial model. For both lopi¬ 
navir and ritonavir, we did not observe possible binding to major 
targets like 3CLpro, PLpro, RdRp, and so on. This docking result 
implies lopinavir and ritonavir tablets may not be suitable for 
treatment of SARS-CoV-2 infections. 

The results of the entire article are based on computer virtual 
screening. We have not conduct further in vivo and in vitro anti¬ 
viral experiments yet, because we want to share our results with 
scientists in anti-SARS-CoV-2 research as soon as possible. Our 


subsequent research will try to solve the three-dimensional 
structures of all 24 proteins of SARS-CoV-2 and their drug 
complexes, providing more target information for drug interven¬ 
tion and long-term drug design, perform in vivo and in vitro 
evaluations for candidate drugs obtained in this study, and prepare 
for clinical trial applications. 
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